Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Implementation of an online database for tables of contents of books

Identifieur interne : 002300 ( Main/Exploration ); précédent : 002299; suivant : 002301

Implementation of an online database for tables of contents of books

Auteurs : M. Jell [Allemagne] ; B. Reuse [Allemagne] ; G. Kessling [Allemagne]

Source :

RBID : Pascal:98-0231108

Descripteurs français

English descriptors

Abstract

Many small libraries do not have the resources to build a holdings database. Thanks to the availability of affordable scanners and improved OCR software, a new approach for creating an online database is possible. This database is filled through a series of stages. First, the book information and table of contents pages are scanned and converted to text using OCR software. Then, a computer program is used to extract as much information as possible, with a human making corrections and supplying missing information. Finally, the information, which consists of the title, author, ISBN, publication year, call number and other relevant information for books, as well as the entire table of contents, is stored and added to an Ovid database.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Implementation of an online database for tables of contents of books</title>
<author>
<name sortKey="Jell, M" sort="Jell, M" uniqKey="Jell M" first="M." last="Jell">M. Jell</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>Max-Planck-lnstitut für biophysikalische Chemie</s1>
<s2>37070 Göttingen</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="2">Basse-Saxe</region>
<settlement type="city">Göttingen</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Reuse, B" sort="Reuse, B" uniqKey="Reuse B" first="B." last="Reuse">B. Reuse</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>Max-Planck-lnstitut für biophysikalische Chemie</s1>
<s2>37070 Göttingen</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="2">Basse-Saxe</region>
<settlement type="city">Göttingen</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Kessling, G" sort="Kessling, G" uniqKey="Kessling G" first="G." last="Kessling">G. Kessling</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>Max-Planck-lnstitut für biophysikalische Chemie</s1>
<s2>37070 Göttingen</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="2">Basse-Saxe</region>
<settlement type="city">Göttingen</settlement>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">98-0231108</idno>
<date when="1998">1998</date>
<idno type="stanalyst">PASCAL 98-0231108 INIST</idno>
<idno type="RBID">Pascal:98-0231108</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000896</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000B01</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000830</idno>
<idno type="wicri:doubleKey">0264-0473:1998:Jell M:implementation:of:an</idno>
<idno type="wicri:Area/Main/Merge">002424</idno>
<idno type="wicri:Area/Main/Curation">002300</idno>
<idno type="wicri:Area/Main/Exploration">002300</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Implementation of an online database for tables of contents of books</title>
<author>
<name sortKey="Jell, M" sort="Jell, M" uniqKey="Jell M" first="M." last="Jell">M. Jell</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>Max-Planck-lnstitut für biophysikalische Chemie</s1>
<s2>37070 Göttingen</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="2">Basse-Saxe</region>
<settlement type="city">Göttingen</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Reuse, B" sort="Reuse, B" uniqKey="Reuse B" first="B." last="Reuse">B. Reuse</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>Max-Planck-lnstitut für biophysikalische Chemie</s1>
<s2>37070 Göttingen</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="2">Basse-Saxe</region>
<settlement type="city">Göttingen</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Kessling, G" sort="Kessling, G" uniqKey="Kessling G" first="G." last="Kessling">G. Kessling</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>Max-Planck-lnstitut für biophysikalische Chemie</s1>
<s2>37070 Göttingen</s2>
<s3>DEU</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Allemagne</country>
<placeName>
<region type="land" nuts="2">Basse-Saxe</region>
<settlement type="city">Göttingen</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Electronic library</title>
<title level="j" type="abbreviated">Electron. libr.</title>
<idno type="ISSN">0264-0473</idno>
<imprint>
<date when="1998">1998</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Electronic library</title>
<title level="j" type="abbreviated">Electron. libr.</title>
<idno type="ISSN">0264-0473</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Book</term>
<term>Database</term>
<term>Description</term>
<term>Implementation</term>
<term>Methodology</term>
<term>On line</term>
<term>Table of contents</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Base donnée</term>
<term>Sommaire</term>
<term>Livre</term>
<term>Implémentation</term>
<term>En ligne</term>
<term>Méthodologie</term>
<term>Description</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Base de données</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Many small libraries do not have the resources to build a holdings database. Thanks to the availability of affordable scanners and improved OCR software, a new approach for creating an online database is possible. This database is filled through a series of stages. First, the book information and table of contents pages are scanned and converted to text using OCR software. Then, a computer program is used to extract as much information as possible, with a human making corrections and supplying missing information. Finally, the information, which consists of the title, author, ISBN, publication year, call number and other relevant information for books, as well as the entire table of contents, is stored and added to an Ovid database.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Allemagne</li>
</country>
<region>
<li>Basse-Saxe</li>
</region>
<settlement>
<li>Göttingen</li>
</settlement>
</list>
<tree>
<country name="Allemagne">
<region name="Basse-Saxe">
<name sortKey="Jell, M" sort="Jell, M" uniqKey="Jell M" first="M." last="Jell">M. Jell</name>
</region>
<name sortKey="Kessling, G" sort="Kessling, G" uniqKey="Kessling G" first="G." last="Kessling">G. Kessling</name>
<name sortKey="Reuse, B" sort="Reuse, B" uniqKey="Reuse B" first="B." last="Reuse">B. Reuse</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 002300 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 002300 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:98-0231108
   |texte=   Implementation of an online database for tables of contents of books
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024